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QUANTILE  AND  PROBABILITY  CURVES  WITHOUT  CROSSING 

VICTOR  CHERNOZHUKOVt     IVAN  FERNANDEZ- VAL§     ALFRED  GALICHON* 

Abstract.  The  most  common  approach  to  estimating  conditional  quantile  curves  is  to  fit  a  curve, 
typically  linear,  pointwise  for  each  quantile.  Linear  functional  forms,  coupled  with  pointwise  fitting,  are 
used  for  a  number  of  reasons  including  parsimony  of  the  resulting  approximations  and  good  computa- 
tional properties.  The  resulting  fits,  however,  may  not  respect  a  logical  monotonicity  requirement  -  that 
the  quantile  curve  be  increasing  as  a  function  of  probability.  This  paper  studies  the  natural  monotoniza- 
tion  of  these  empirical  curves  induced  by  sampling  from  the  estimated  non-monotone  model,  and  then 
taking  the  resulting  conditional  quantile  curves  that  by  construction  are  monotone  in  the  probability. 
This  construction  of  monotone  quantile  curves  may  be  seen  as  a  bootstrap  and  also  as  a  monotonic  re- 
arrangement of  the  original  non-monotone  function.  It  is  shown  that  the  monotonized  curves  are  closer 
to  the  true  curves  in  finite  samples,  for  any  sample  size.  Under  correct  specification,  the  rearranged 
conditional  quantile  curves  have  the  same  asymptotic  distribution  as  the  original  non-monotone  curves. 
Under  misspecification,  however,  the  asymptotics  of  the  rearranged  curves  may  partially  differ  from  the 
asymptotics  of  the  original  non-monotone  curves.  An  analogous  procedure  is  developed  to  monotonize 
the  estimates  of  conditional  distribution  functions.  The  results  are  derived  by  establishing  the  compact 
(Hadamard)  diflferentiability  of  the  monotonized  quantile  and  probability  curves  with  respect  to  the 
original  curves  in  discontinuous  directions,  tangentially  to  a  set  of  continuous  functions.  In  doing  so, 
the  compact  differentiability  of  the  rearrangement-related  operators  is  established. 
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1.  Introduction  and  Discussion 

The  problem  studied  in  this  paper  can  be  best  described  using  Hnear  quantile  regres- 
sion as  the  prime  example  (Koenker,  2005).  Suppose  that  x'(3{u)  is  a  linear  approxima- 
tion to  the  u-quantile  Qo{u\x)  of  a  real  response  variable  Y,  given  a  vector  of  regressors 
X  =  X.  The  typical  estimation  methods  fit  the  conditional  curve  x'P{u)  pointwise  in 
u  G  (0, 1)  producing  an  estimate  x  (5{u).  Linear  functional  forms,  coupled  with  pointwise 
fitting,  are  used  for  a  number  of  reasons  including  parsimony  of  the  resulting  approxi- 
mations and  good  computational  properties  (Portnoy  and  Koenker,  1997).  However,  a 
problem  that  might  occur  is  that  the  map 

u  I— >  X  P{u) 

may  not  be  increasing  in  u,  which  violates  the  logical  monotonicity  requirement.  Another 
manifestation  of  this  issue,  known  as  the  "quantile  crossing  problem"  (He,  1997),  is  that 
the  conditional  quantile  curves  x  f-^  x' (3[u)  may  cross  for  different  values  of  u. 

In  the  analysis  we  shall  distinguish  the  following  two  cases,  each  leading  to  the  lack 
of  monotonicity  or  the  crossing  problem: 

(1)  Monotonically  correct  case:  The  population  curve  u  i— >  x'P{u)  is  increasing  in  u, 
and  thus  satisfies  the  monotonicity  requirement.  However,  the  empirical  curve 
u  H^-  x'P{u)  may  be  non-monotone  due  to  estimation  error. 

(2)  Monotonically  incorrect  case:  The  population  curve  u  <-^  x'P{u)  is  non-monotone 
due  to  imperfect  approximation  to  the  true  conditional  quantile  function.  Ac- 
cordingly, the  resulting  empirical  curve  u  i-^  x' (5{u)  is  also  non-monotone  due  to 
both  non-monotonicity  of  the  population  curve  and  estimation  error. 

Consider  the  random  variable 

Y^  :=  x%U)  where  U  ~  ?7(0, 1). 

This  variable  can  be  seen  as  a  bootstrap  draw  from  the  estimated  quantile  regression 
model,  as  in  Koenker  (1994),  and  has  the  distribution  function 

F{y\x)=  I   l{x'P{u)<y]du.  (1.1) 

JQ 
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Moreover,  inverting  the  distribution  function,  one  obtains  a  proper  quantile  function 

F-\u\x)  =  mi{y:F{y\x)>u},  (1.2) 

which  is  monotone  in  u.  The  rearranged  quantile  function  F~^{u\x)  coincides  with  the 
original  curve  x  (3{u)  if  the  original  curve  is  increasing  in  u,  but  differs  from  the  original 
curve  otherwise.  Thus,  starting  with  a  possibly  non-monotone  original  curve  u  h- >  x  /3{u), 
the  rearrangement  (1.1)-(1.2)  produces  a  monotone  quantile  curve  u  t— »  F^^{u\x).  In 
what  follows,  we  focus  our  attention  on  the  interval  (0, 1)  without  loss  of  generality. 
Indeed,  any  closed  subinterval  of  (0, 1)  can  also  be  considered  isomorphically  to  the 
treatment  of  the  unit  interval  case,  as  commented  further  in  Section  2. 

As  mentioned  above,  this  rearrangement  mechanism  has  a  direct  relation  to  the  quan- 
tile regression  bootstrap  (Koenker,  1994),  since  the  rearranged  quantile  curve  is  produced 
by  sampling  from  the  estimated  original  quantile  model.  Moreover,  both  the  mechanism 
and  its  name  have  a  direct  relation  to  rearrangement  maps  in  variational  analysis  and  op- 
erations research  (e.g..  Hardy,  Littlewood,  and  Polya,  1952,  and  Villani,  2003).  Further 
important  references  on  the  rearrangement  method  are  discussed  below. 

The  purpose  of  this  paper  is  to  establish  the  empirical  properties  of  the  rearranged 
quantile  curves  and  their  distribution  counterparts: 


u 


F  ^{u\x)    and   y  \-^  F{y\x), 


under  scenarios  (1)  and  (2).  The  paper  also  characterizes  certain  analytical  and  approx- 
imation properties  of  the  corresponding  population  curves: 

u  1-^  F~^{u\x)  =  inf{y  :  F{y\x)  >  u},    and  y  i-^  F{y\x)  =   /    l{x'P{u)  <  y}du. 

Jo 

The  first  main  result  of  the  paper  establishes  the  improved  estimation  properties  of 
the  rearranged  curves.  We  show  that  the  rearranged  curve  F~^{u\x)  is  closer  to  the  true 
conditional  quantile  curve  Qo{u\x)  than  the  original  curve.  Formally,  for  each  x,  we  have 
that  for  all  p  G  [l,oo] 

/  \Qo{u\x)  -  F-\u\x)\Pdu\       <(  f  \Qo{u\x)  -  x'p{u)\''du 


where  the  inequahty  is  strict  for  p  €  (l,C)o)  whenever  u  <—>  x  P{u)  is  decreasing  on 
a  subset  of  U  :=  (0, 1)  of  positive  Lebesgue  measure,  while  u  i-^  Qo{u\x)  is  strictly 
increasing.  This  property  is  independent  of  the  sample  size,  and  thus  continues  to  hold 
in  the  population,  and  also  regardless  of  whether  the  linear  quantile  estimator  x  P{u) 
estimates  Qo{u\x)  consistently  or  not,  i.e.  whether  Qq{u\x)  =  x'P{u)  or  Qo{u\x)  ^ 
x'P{u).  In  other  words,  the  rearranged  quantile  curves  have  smaller  estimation  error 
than  the  original  curves  whenever  the  latter  are  not  monotone.  This  is  a  very  important 
property  that  does  not  depend  on  the  way  the  quantile  model  is  estimated.  It  also  does 
not  rely  on  any  other  specifics  of  the  current  context  and  is  therefore  applicable  quite 
generally. 

Towards  describing  the  essence  of  the  rest  of  results,  let  us  fix  the  value  of  the  regressor 
X  to  x.  Suppose  that  P{u)  is  an  estimator  for  P{u)  that  converges  weakly  to  a  Gaussian 
process  G{u),  so  that 

Vnx'0{u)  -  Piu))  =4>  x'G{u),  (1.3) 

as  a  stochastic  process  indexed  by  u  in  the  metric  space  of  bounded  functions  £°°(0, 1). 
For  sufficient  conditions,  see,  for  example,  Gutenbrunner  and  Jureckova  (1992),  Portnoy 
(1991),  and  Angrist,  Chernozhukov,  and  Fernandez- Val  (2006). 

The  second  main  result  of  the  paper  is  that  in  the  monotonically  correct  case  (1), 

MF{y\x)  -  F{y\x))  =^  F'{y\x)[x'G{F{y\x))],  (1.4) 

as  a  stochastic  process  indexed  by  y  in  the  metric  space  i°°{y),  where  3^  is  the  support 
of  K-;  and 


'  XI 
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x'/3'(u) 
Moreover,  we  show  that 


^'(y|^)  = -T^TTT         ,,.'    ^ith   p'{u):= 


d(3{u) 


u=F{y\x)  du 


Vn{F-\u\x)  -  F-\u\x))  ^  x'G{u),  (1.5) 

as  a  stochastic  process  indexed  by  u,  in  i°°{0, 1);  which,  remarkably,  coincides  with  the 
first  order  asymptotics  (1.3)  of  the  original  curve.  This  result  has  a  convenient  practical 
implication:  if  the  population  curve  is  monotone,  then  the  empirical  non-monotone 
curve  can  be  re-arranged  to  be  monotonic  without  affecting  its  (first  order)  asymptotic 
properties.    To  derive  the  above  results  we  find  the  functional  Hadamard  derivatives 
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of  F{y\x)  and  F~^{u\x)  with  respect  to  perturbations  of  the  underlying  curve  x'P{u) 
in  discontinuous  directions,  tangentially  to  the  set  of  continuous  functions,  and  then 
use  the  functional  delta  method.  Establishing  the  Hadamard  differentiability  of  the 
rearranged  distribution  and  quantile  curves  in  discontinuous  directions  is  the  second 
main  theoretical  result  of  the  paper. 

The  third  main  result  is  that  in  the  monotonically  incorrect  case 

^™.) -.(„.,).  g'||M^.  (1.) 

as  a  stochastic  process  indexed  by  y  E  K,  in  i°°{K),  where  K  is  an  appropriate  set 
defined  in  the  next  section.  Here  Ui{y\x)  <  ...  <  UK{y\x){y\x)  are  solutions  to  the 
equation  y  =  x'P{u),  assuming  that  K{y\x)  is  bounded.  Similarly,  for  the  rearranged 
quantile  curve, 

(1.7) 
as  a  stochastic  process  indexed  by  u  G  A'',  in  i°°{K'),  where  K'  is  an  appropriate  set 
defined  in  the  next  section. 

Analogously  to  quantiles,  most  estimation  methods  for  conditional  distribution  func- 
tions do  not  impose  monotonicity,  and  therefore  can  give  rise  to  non-monotonic  empirical 
conditional  distribution  curves;  see,  for  example.  Hall,  Wolff,  and  Yao  (1999).  A  similar 
monotone  rearrangement  can  be  applied  to  these  distribution  curves  by  exchanging  the 
roles  plaj^ed  by  the  quantile  and  the  probability  spaces.  Thus,  suppose  that  P{y\x)  is  a 
candidate  estimate  of  a  conditional  distribution  function,  which  is  not  monotone  in  y. 
The  rearranged  monotone  quantile  curve  associated  with  P{y\x)  is 

/■oo  ^0 

Q{u\x)=  l{P{y\x)<u]dy-  \{P  {y\x)  >  u]dy. 

Jo  J-oo 

The  rearranged  probability  curve  can  then  be  obtained  as  the  inverse  of  this  quantile 
curve,  i.e., 

F{y\x)  =  inf  ju  :  Q  {u\x)  >  yj  , 


which  is  monotone  by  construction.  Section  3  shows  in  more  detail  that  similar  improved 
estimation  properties  and  an  asymptotic  distribution  theory  goes  through  for  Q  and  F. 

The  distributional  results  in  the  paper  do  not  rely  on  the  sampling  properties  of  the 
particular  estimation  method  used,  because  they  are  expressed  in  terms  of  the  differ- 
entiability of  the  operator  with  respect  to  the  basic  estimated  process.  Moreover,  the 
results  that  follow  are  derived  without  imposing  linearity  of  the  functional  forms.  The 
only  conditions  required  are  that  (1)  a  central  limit  theorem  like  (1.3)  applies  to  the 
estimator  of  the  curve,  and  (2)  the  population  curves  have  some  smoothness  properties. 
The  exact  nature  of  these  population  curves  does  not  affect  the  validity  of  the  results. 
For  example,  the  results  hold  regardless  of  whether  the  underlying  model  is  an  ordinary 
or  an  instrumental  quantile  regression  model. 

There  exist  other  methods  to  obtain  monotonic  fits  based  on  quantile  regression. 
He  (1997),  for  example,  proposes  to  impose  a  location-scale  regression  model,  which 
naturally  satisfies  monotonicity.  This  approach  is  fruitful  for  location-scale  situations, 
but  in  numerous  cases  data  do  not  satisfy  the  location-scale  model,  as  discussed,  for 
example,  in  Lehmann  (1974),  Doksum  (1974),  and  Koenker  (2005).  Koenker  and  Ng 
(2005)  develop  a  computational  method  for  quantile  regression  that  imposes  the  non- 
crossing  constraints  in  simultaneous  fitting  of  quantile  curves.  This  approach  may  be 
fruitful  in  many  situations,  but  the  statistical  properties  of  the  method  remain  unknown. 
Clearly,  Koenker  and  Ng's  proposal  is  different  from  the  rearrangement  method. 

The  distributional  results  obtained  in  the  paper  can  also  be  viewed  as  a  functional 
delta  method  for  the  rearrangement-related  operators  (1.1)  and  (1.2)  that  include  the 
inverse  (quantile)  operators  as  a  special  case.  In  this  sense,  they  extend  the  previous 
results  by  Gill  and  Johansen  (1990),  Doss  and  Gill  (1992),  and  Dudley  and  Norvaisa 
(1999)  on  compact  differentiability  of  the  quantile  operator.  The  main  technical  difficulty 
here,  as  well  as  in  the  quantile  case,  is  that  differentiability  needs  to  be  established  in 
discontinuous  directions  (that  converge  to  continuous  directions,  i.e.,  tangentially  to 
the  set  of  continuous  functions),  because  the  empirical  perturbations  of  the  quantile 
processes  are  typically  step  functions. 

Both  the  statistical  and  mathematical  results  of  this  paper  complement  the  important 
work  of  Dette,  Neumeyer,  and  Pilz  (2006),  which  applies  the  rearrangement  operators  to 


kernel  mean  regressions,  in  order  to  obtain  mean  regression  functions  that  are  monotonic 
in  the  regressors.  Our  results  on  Hadamard  differentiability  in  discontinuous  directions 
•are  new.  They  complement  the  local  expansions  in  smooth  directions  subsumed  in  the 
proofs  in  Dette  et.  al.  for  the  case  called  here  the  monotonically  correct  case.  In 
addition,  our  results  cover  monotonically  incorrect  cases.  The  statistical  problem  also 
differs  quite  substantially.  The  mathematical  results  of  this  paper  also  complement  the 
results  on  directional  differentiability  of  Li-  functionals  of  rearranged  functions  like  (1.2) 
by  Mossino  and  Temam  (1981).  The  results  for  Li -functionals  do  not  imply  the  main 
results  of  this  paper,  such  as  (1.4)-(1.7),  but  the  converse  is  shown  to  be  true.  (See 
discussion  after  Proposition  4  in  Section  2  for  more  details.) 

There  are  many  potential  applications  of  the  estimation  and  differentiability  results  of 
the  paper  to  objects  other  than  probability  or  quantile  curves.  For  example,  in  a  com- 
panion work  we  present  applications  to  economic  demand  and  production  functions  and 
to  biometric  growth  curves,  where  monotonization  is  used  to  impose  useful  theoretical 
or  logical  restrictions  (Chernozhukov,  Fernandez- Val,  and  Galichon,  2006a). 

We  organize  the  rest  of  the  paper  as  follows.  In  Section  2.1  we  describe  some  basic 
analytical  properties  of  the  rearranged  population  curves.  In  Section  2.2  we  derive  the 
functional  differentiability  results.  In  Section  2.3  we  present  estimation  properties  of 
the  rearranged  curves  and  establish  their  limit  distributions.  In  Section  3  we  extend 
the  previous  results  to  monotonize  estimates  of  distribution  curves.  In  Section  4.1  we 
illustrate  the  rearrangement  procedure  with  an  empirical  application,  and  in  Section  4.2 
we  provide  a  Monte-Carlo  example.  In  Section  5  we  conclude  with  a  summary  of  the 
main  results. 


2.  Rearranged  Quantile  Curves:  Analytical  and  Empirical  Properties 

In  this  section  the  treatment  of  the  problem  is  somewhat  more  general  than  in  the  in- 
troduction. In  particular,  we  replace  the  linear  functional  form  x'/3{u)  by  Q{u\x).  Define 
y^  :=  Q{U\x),  where  U  ~  Uniform(W)  with  U  =  (0,1).  Let  F{y\x)  :=  J^  l{Q{u\x)  < 
y]du  be  the  distribution  function  of  Y^^  and  F~^{u\x)  :=  inf{y  :  F{y\x)  >  u}  be  the 
quantile  function  oiYx. 


Remark.  We  consider  the  interval  (0, 1)  without  loss  of  generality.  Indeed,  suppose 
we  are  interested  in  a  particular  subinterval  {a,  a  +  b)  of  (0,1).  For  example,  we  may 
wish  to  focus  estimation  on  a  particular  range  of  quantiles  or  to  avoid  estimation  of  tail 
quantiles.  For  this  purpose,  we  define  all  objects  conditionally  on  the  event  U  €  (a,  a+b): 
Y^  :=  Q{U\x)  =  Q{a  +  bU\x),  where  U  ~  C/(0, 1),  F{y\x)  :=  /^  l{Q{u\x)  <  y}du,  and 
F~^{u\x)  :=  inf{y  :  F{y\x)  >  ii}  for  u  G  (0, 1).  The  analysis  of  the  paper  applies  to  the 
functions  Q  and  F"^.  In  order  to  go  back  to  the  unconditional  quantities,  we  can  use 
the  transformations  Q{u\x)  =  Q{{u  —  a)/b\x)  for  u  G  (a,  a  +  b)  and  F{y\x)  =  a  +  bF{y\x) 
for  y  G  {Q{u\x)  :  u  E  {a,a  +  b)}. 

2.1.  Basic  Analytical  Properties.  We  start  by  developing  some  basic  properties  for 
F{y\x)  and  F~^{u\x),  the  population  counterparts  of  the  rearranged  distribution  curve 
and  its  inverse.  We  need  these  properties  to  derive  various  empirical  properties  stated 
in  the  next  section. 

Recall  first  the  following  definitions  from  Milnor  (1965):  Let  g  :  U  C  R  ^  M.  he  a 
continuously  differentiable  function.  A  point  u  eU  is  called  a  regular  point  of  g  if  the 
derivative  of  g  at  this  point  does  not  vanish,  i.e.,  g'  (u)  7^  0.  A  point  u  which  is  not  a 
regular  point  is  called  a  critical  point.  A  value  y  E  g  {U)  is  called  a  regular  value  of  g 
if  ^"^  ({y})  contains  only  regular  points,  i.e.,  if  Vu  G  g~^  ({y}),  9'  (^)  7^  0.  A  value  y 
which  is  not  a  regular  value  is  called  a  critical  value. 

Denote  by  y^  the  support  of  Y^,  yX  :=  {{y,x)  :  y  Ey^^x  E  X},  and  UX  ■=14  x  X. 
We  assume  throughout  that  y^;  C  y,  which  is  compact  subset  of  R,  and  that  x  E  X,  a. 
compact  subset  of  R''.  In  some  applications  the  curves  of  interest  are  not  functions  of 
x,  or  we  might  be  interested  in  a  particular  value  x.  In  this  case,  the  set  X  is  taken  to 
be  a  singleton  X  =  {x}.  We  make  the  following  assumptions  about  Q{u\x): 

(a)  Q{u\x)  :  U  X  X  -^  R  is  a  continuously  differentiable  function  in  both  arguments, 

(b)  For  each  x  E  X,  the  number  of  elements  of  {u  E  U  \  Q'{u\x)  =  0}  is  finite  and 
uniformly  bounded  on  x  E  X. 

Assumption  (b)  implies  that,  for  each  x  E  X,  Q'{u\x)  is  not  zero  almost  everywhere  on 
U  and  can  only  switch  sign  a  bounded  number  of  times.  Let  y*  be  the  subset  of  regular 
values  oiu>-^  Q{u\x)  in  3^^,  and  yX*  :=  {{y,x)  :  y  Ey*,x  E  X}. 
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Proposition  1  (Basic  Properties  of  F{y\x)  and  F~^{u\x)).  Under  assumptions  (a)  - 
(b),  the  functions  F{y\x)  and  F~^{u\x)  satisfy  the  following  properties: 

1.  The  set  of  critical  values,  yx  \  yi,  is  finite,  and  Jyxy.  dF{y\x)  =  0. 

2.  For  any  y  e  y* 

K{y\x) 

F{y\x)  =   Y^  sign{Q'{ukiy\x)\x)}uk{y\x)  +  l{Q'{uK{y\x){y\x)\x)  <  0}, 
fc=i 

where  {ufc(y|x),  fork  =  l,...,K{y\x)  <  oo}  are  the  roots  of  Q{u\x)  =  y  in  increasing 
order. 

3.  For  any  y  E  y*,  the  ordinary  derivative  f{y\x)  =  dF{y\x)/dy  exists  and  takes  the 
form 

^^^1^^  =    IJ    \Q'{u,{y\x)\x)y 

which  is  continuous  at  each-  y  €  3^*.  For  any  y  &  y  \  y^,  set  f{y\x)  :=  0.  F{y\x)  is 
absolutely  continuous  and  strictly  increasing  in  y  E  yx-  Moreover,  f{y\x)  is  a  Radon- 
Nikodym  derivative  of  F{y\x)  with  respect  to  the  Lebesgue  measure. 

4-   The  quantile  function  F~^{u\x)  partially  coincides  with  Q{u\x);  namely 

F-\u\x)  =  Q{u\x), 

provided  that  Q{u\x)  is  increasing  at  u,  and  the  equation  Q{u\x)  =  y  has  unique  solution 
for  y  =  F~^{u\x). 

5.  The  quantile  function  F~^{u\x)  is  equivariant  to  location  and  scale  transformations 
ofQ{u\x). 

6.  The  quantile  function  F~^{u\x)  has  an  ordinary  continuous  derivative 

l/f{F-\u\x)\x), 

when  F~^{u\x)  G  3^*.  This  function  is  also  a  Radon-Nikodym  derivative  with  respect  to 
the  the  Lebesgue  measure. 

7.  The  map  {y,x)  i— >  F{y\x)  is  continuous  on  yX  and  the  map  {u,x)  i-^  F''^{u\x)  is 
continuous  on  lAX . 
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The  following  simple  example  illustrates  some  of  these  basic  properties  in  a  situation 
where  the  initial  population  pseudo-quantile  curve  is  highly  non-monotone.  Consider 
the  following  pseudo-quantile  function: 

Q{u)  =  5  (u  +  -  sm{2TTu)]  .  (2.1) 

The  left  panel  of  Figure  1  shows  that  this  function  is  non-monotone  in  [0, 1].  In  par- 
ticular, the  slope  of  Q{u)  changes  sign  twice  at  1/3  and  2/3.  The  rearranged  quantile 
curve  F~^{u),  also  plotted  in  this  panel,  is  continuous  and  monotonically  increasing. 
The  results  1,  2,  4  and  7  of  the  proposition  are  illustrated  in  the  right  panel  of  Figure 
1,  which  plots  the  original  and  rearranged  distribution  curves.  Here  we  can  see  that 
the  rearranged  distribution  function  is  continuous,  does  not  have  mass  points,  and  co- 
incide with  the  original  curve  for  values  of  y  where  the  original  curve  is  one  to  one  and 
increasing. 

Figure  2  illustrates  the  third  and  sixth  results  of  the  proposition  by  plotting  the 
sparsity  function  for  F~^{u)  and  the  density  function  of  F{y).  The  derivative  of  F{y) 
in  the  right  panel  is  continuous  at  the  regular  values  of  Q{u).  Similarly,  the  sparsity 
function  for  F~^{u)  in  the  left  panel  is  continuous  at  the  corresponding  image  values 
(under  F{y)). 

2.2.  Functional  Derivatives.  Next,  we  establish  the  main  results  of  the  paper  on 
Hadamard  differentiability  of  F{y\x)  and  F~^{u\x)  with  respect  to  Q{u\x),  tangentially 
to  the  space  of  continuous  functions  on  UX.  This  differentiability  property  is  important 
for  deriving  the  asymptotic  distributions  of  the  rearranged  estimates.  In  particular,  the 
property  allows  us  to  establish  generic  convergence  results  for  rearranged  curves  based  on 
any  initial  quantile  estimator,  provided  the  initial  estimator  satisfies  a  functional  central 
limit  theorem.  The  property  also  imphes  that  the  bootstrap  is  valid  for  performing 
inference  on  the  rearranged  estimates,  provided  the  bootstrap  is  valid  for  the  initial 
estimates.  This  result  follows  from  the  functional  delta  method  for  the  bootstrap  (e.g., 
Theorem  13.9  in  van  der  Vaart,  1998). 

In  what  follows,  £°°{UX)  denotes  the  set  of  bounded  and  measurable  functions  h  : 
UX  — >  R,  C{UX)  denotes  the  set  of  continuous  functions  mapping  h  :  UX  -^  R,  and 
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Figure  1.  Left:  The  pseudo-quantile  function  Q{u)  and  the  rearranged 
quantile  function  F^^{u).  Right':  The  pseudo-distribution  function  Q~^{y) 
and  the  rearranged  distribution  function  F{y). 


,(y, 

;     - 

w 

Figure  2.  Left:  The  density  (sparsity)  function  of  the  rearranged  quan- 
tile function  F~^{u).  Right:  The  density  function  of  the  rearranged  dis- 
tribution function  F{y). 
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i^{UX)  denotes  the  set  of  measurable  functions  h  :  UX  -^  M  such  that  /^  /^  \h{u\x)\dudx 
<  oo,  where  du  and  dx  denote  the  integration  with  respect  to  the  Lebesgue  measure  on 
^  and  A",  respectively. 

Proposition  2  (Hadamard  Derivative  of  F{y\x)  with  respect  to  Q{u\x)).  Define  F{y\x,  ht) 
:=  /q  \{Q{u\x)  +  tht{u\x)  <  y}du.   Under  assumptions  (a)-(h),  as  t  ^^  0, 

DM.,t)^!M^hLzim^O,(yM,  (22) 

DMr)  :=  -  Y,   I'r^r'r'li-  (2.3) 

^   \Q'{uk{y\x)\x)\ 

The  convergence  holds  uniformly  in  any  compact  subset  ofyX*  :=  {(y,a:)  :  j/  G  3^*,x  6 
X),  for  every  \ht  -  h\oo  ^  0,  where  ht  G  ^°°  {UX),  and  h  E  C{UA:). 

Proposition  3  (Hadamard  Derivative  of  F~^{u\x)  with  respect  to  Q{u\x)).  Under  as- 
surnpitions  (a)-(h),  as  t  ^  0, 

^>,Mx.O:^^"''"l''''">,-^"''"l"'^i).M.).  (2.4) 

P>W:=-^(^_.'„|^)|^)^P.(F-'M..)W.  (2.5) 

The  convergence  holds  uniformly  in  any  compact  subset  ofUX*  =  {{u,x)  :  {F~^  {u\x) ,  x)  6 
yX*},  for  every  \ht  -  hU  ^  0,  where  ht  E  i°°  {UX),  and  h  G  C{UX). 

The  convergence  results  hold  uniformly  on  regions  that  exclude  the  critical  values  of 
the  mapping  u  h^  Q{u\x).  At  the  critical  values,  Q{u\x)  possibly  changes  from  increasing 
to  decreasing.  Moreover,  in  the  monotonically  correct  case  (1),  the  following  result  is 
worth  emphasizing; 

Corollary  1  (Monotonically  correct  case).  Suppose  u  k^  Q{u\x)  has  Q'{u\x)  >  0,  for 
each  {u,x)  G  UX,  then  yX*  =  yX  and  UX*  =  UX.  Therefore,  the  convergence  in 
Propositions  2  and  3  holds  uniformly  over  the  entire  yX  and  UX,  respectively.  More- 
over, Dh{u\x)  =  h,  i.e.,  the  Hadamard  derivative  of  the  rearranged  quantile  with  respect 
to  the  original  curve  is  the  identity  operator. 
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The  convergence  is  uniform  over  the  entire  domain  in  the  monotonically  correct  case. 
This  result  raises  naturally  the  question  of  whether  uniform  convergence  can  be  achieved 
by  some  operation  of  smoothing  in  the  monotonically  incorrect  case  -  namely  integrating 
either  over  y  (or  over  u).  The  answer  is  indeed  yes. 

The  following  proposition  calculates  the  Hadamard  derivative  of  the  following  func- 
tionals  obtained  by  integration; 

{y',x)^   /  l{y  <y'}g{y\x)F{y\x)dy,    {u',x)^  /  \{u  <u']g{u\x)F~\u\x)du, 
Jy  JU 

with  the  restrictions  on  g  specified  below.     These  elementary  functionals  are  useful 

building  blocks  for  various  statistics,  as  briefly  mentioned  in  the  next  section. 

Proposition  4.   The  following  results  are  true  with  the  limits  being  continuous  on  the 
specified  domains: 

1.       f  \{y  <  y'}g{y\x)Dh,{y\^,t)dy-^_  [  l{y  <  y'}g{y\x)Dh{y\x)dy 
Jy  Jy 

uniformly  in  (y',  x)  €  yX,  for  any  g  G  i°°{yPt!)  such  that  x  i— >  g{y\x)  is  continuous  for 
a.e.  y. 

2.       /   l{u  <  u'}g{u\x)Dfn{u\x,t)du -^    /  l{u  <  u'}g{u\x)Dh{u\x)du 
Ju  JU 

uniformly  in  {u',x)  6  UA!,  for  any  g  E  d-^iJAX)  such  that  x  (-+  g[u\x)  is  continuous  for 

a.e.  u. 

This  proposition  essentially  is  a  corollary  of  Propositions  2  and  3.  Indeed,  the  results 
(l)-(2)  follow  from  the  fact  that  the  pointwise  convergence  of  Propositions  2  and  3, 
coupled  with  the  uniform  integrability  shown  in  Lemma  3  in  the  Appendix,  permits  the 
interchange  of  limits  and  integrals.  An  alternative  way  of  proving  result  (2),  but  not  any 
other  result  in  the  paper,  can  be  based  on  exploiting  the  convexity  of  the  functional  in 
(2)  with  respect  to  the  underlying  curve,  following  the  approach  of  Mossino  and  Temam 
(1981),  and  Alvino,  Lions,  and  Trombetti  (1989).  Due  to  this  limitation,  we  do  not 
pursue  this  approach  in  this  paper.  However,  details  of  this  approach  are  described 
in  Chernozhukov,  Fernandez- Val,  and  Galichon  (2006b)  with  an  application  to  some 
nonparametric  estimation  problems. 
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It  is  also  worth  emphasizing  the  properties  of  the  following  smoothed  functionals.  For 
a  measurable  function  /  :  R  h->-  R  define  the  smoothing  operator  as 

Sf{y):=  jh{y-y')f{y')dy\  (2.6) 

where  ks{v)  =  l{\v\  <  5}/25  and  5  >  0  is  a  fixed  bandwidth.  Accordingly,  the  smoothed 
curves  SF{y\x)  and  5F~^(u|x)  are  given  by 

SF{y\x)  :=   f  ks{y  -  y')F{y'\x)dy\       SF-\u\x)  :=  j  ks{u  -  i^)F-\u'\x)dv! . 

Since  these  curves  are  merely  formed  as  differences  of  the  elementary  functionals  in 
Proposition  4,  followed  by  a  division  by  5,  the  following  corollary  is  immediate. 

Corollary  2.  We  have  that  SDhi{y\x,t)  — >  SDh{y\x)  uniformly  in  {y,x)  G  J^^Y,  and 
SDht{u\x,t)  -^  SDh{u\x)  uniformly  in  {u,x)  e  UX. 

Note  that  smoothing  accomplishes  uniform  convergence  over  the  entire  domain,  which 
is  a  good  property  to  have  from  the  perspective  of  data  analysis. 

2.3.  Empirical  Properties  of  F{y\x)  and  F~^{u\x).  We  are  now  ready  to  state  the 
main  results  for  this  section. 

Proposition  5  (Improvement  in  Estimation  Property  Provided  by  Rearrangement). 
Suppose  that  Q{'\-)  is  an  estimator  (not  necessarily  consistent)  for  some  true  quantile 
curve  Qo{-\-).  Then,  the  rearranged  curve  F^^{u\x)  is  closer  to  the  true  curve  than 
Q{u\x)  in  the  sense  that,  for  each  x  ^  X , 

\Qo{u\x)  -  F-\u\x)\Pduj       <U\QQ{u\x)-Q{u\x)\Pdu]       ,pe[l,oo], 

where  the  inequality  is  strict  for  p  G  (l,oo)  whenever  Q{u\x)  is  decreasing  on  a  subset 
oflA  of  positive  Lebesgue  measure,  while  Qo{u\x)  is  increasing  on  U  . 

The  above  property  is  independent  of  the  sample  size  and  of  the  way  the  estimate  of 
the  curve  is  obtained,  and  thus  continues  to  hold  in  the  population. 

This  proposition  establishes  that  the  rearranged  quantile  curves  have  smaller  estima- 
tion error  than  the  original  curves  whenever  the  latter  are  not  monotone.  This  is  a  very 
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important  property  that  does  not  depend  on  the  way  the  quantile  model  is  estimated. 
It  also  does  not  rely  on  any  other  specifics  and  is  thus  applicable  quite  generally. 

The  following  proposition  investigates  the  asymptotic  distributions  of  the  rearranged 
curves. 

Proposition  6  (Empirical  Properties  of  {y,x)  ^  F{y\x)  and  {u,x)  i-^  F"^(u|x)). 
Suppose  that  Q{-\-)  is  an  estimator  for  Q{-\-)  that  takes  its  values  in  the  space  of  bounded 
measurable  functions  defined  on  UX,  and  that,  in  ^°°{UX), 

^/n{Q[u\x)  —  Q{u\x))  ^  G{u\x), 

as  a  stochastic  process  indexed  by  {u,x)  G  UX,  where  {u,x)  i-^  G{u\x)  is  a  Gaussian 
process  with  continuous  paths.  Assume  also  that  Q{u\x)  satisfies  the  basic  conditions  (a) 
and  (b).   Then  in  i°°{K),  where  K  is  any  compact  subset  ofyX*, 

^{F{y\x)  -  F{y\x))  ^  DG{y\x) 

as  a  stochastic  process  indexed  by  (y,  x)  G  yX* ;  and  in  1°°1JAX k),  with  UXk  =  {(^i  x)  : 
{F-\u\x),x)^K], 

^/n{F-\u\x)- F-^{u\x))^Dg{u\x), 

as  a  stochastic  process  indexed  by  {u,x)  G  UXk- 

Corollary  3  (Monotonically  correct  case).  Suppose  u  \-^  Q{u\x)  has  Q'{u\x)  >  0 
for  each  (u,x)  G  UX,  then  yX*  =  yX  and  UX*  =  UX.  Accordingly,  the  con- 
vergence in  Proposition  5  holds  uniformly  over  the  entire  yX  and  UX.  Moreover, 
Dg{u\x)  =  G{u\x),  i.e.,  the  rearranged  quantile  curves  have  the  same  first  order  as- 
ymptotic distribution  as  the  original  quantile  curves. 

Thus,  in  the  monotonically  correct  case,  the  first  order  properties  of  the  rearranged 
and  initial  quantile  estimates  coincide.  Hence,  all  the  inference  tools  that  apply  to 
original  quantile  estimates  also  apply  to  the  rearranged  quantile  estimates.  In  particular, 
if  the  bootstrap  is  valid  for  the  original  estimate,  it  is  also  valid  for  the  rearranged 
estimate,  by  the  functional  delta  method  for  the  bootstrap.  In  the  empirical  example 
of  Section  4,  we  exploit  this  useful  property  to  construct  uniform  confidence  bands  for 
the  conditional  quantile  functions  based  on  the  rearranged  quantile  function  estimates. 
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In  addition  to  tlie  results  on  quantile  function  estimates,  Proposition  6  provides  the 
asymptotic  properties  of  the  distribution  function  estimates.  The  preceding  remark 
about  the  vahdity  of  bootstrap  applies  also  to  these  estimates. 

In  the  monotonically  incorrect  case,  the  large  sample  properties  of  the  rearranged 
quantile  estimates  differ  from  those  of  the  initial  quantile  estimates.  Proposition  6 
enables  us  to  perform  inferences  for  rearranged  curves  in  this  case,  including  by  the 
bootstrap,  but  only  after  excluding  certain  nonregular  neighborhoods  (for  the  distribu- 
tion estimates,  the  neighborhood  of  the  critical  values  of  the  map  u  t— >  Q{u\x),  and,  for 
the  rearranged  quantile  estimates,  the  image  of  the  latter  neighborhood  under  F{y\x)). 
However,  if  we  consider  the  following  linear  functionals  of  the  rearranged  quantile  and 
distribution  estimates: 

{y',x)^    /  l{y  <  y'}g{y\x)F{y\x)dy,    {u,x)^    /  l{u  <  u'}g{u\x)F~\u\x)du, 
Jy  Ju 

then  we  no  longer  need  to  exclude  the  nonregular  neighborhoods.  The  following  propo- 
sition describes  the  empirical  properties  of  these  functionals  in  large  samples. 

Proposition  7  (Empirical  Properties  of  Integrated  Curves).  Under  the  conditions  of 
Proposition  6,  the  following  results  are  true  with  the  limits  being  continuous  on  the 
specified  domains: 

I.     V^  [  l{y  <  y'}g{y\x){F{y\x)  -  F{y\x))dy  ^    I  \{y  <  y']g{y\x)DG{y\x)dy, 
Jy  Jy 

as  a  stochastic  process  indexed  by  {y',x)  G  yX,  in  i°°{yX). 

2.     ^/n       l{u  <  u'}g{u\x){F-'^{u\x)  -  F-\u\x))du  =»       l{u  <  u'}g{u\x)DG{u\x)du, 
Ju  Ju 

as  stochastic  process  indexed  by  {u',x)  G  UX,  in  i°°{UX). 

The  restrictions  on  the  function  g  are  the  same  as  in  Proposition  4- 

The  linear  functionals  defined  above  are  useful  building  blocks  for  various  statistics, 
such  as  partial  means,  various  moments,  and  Lorenz  curves.  For  example,  the  conditional 
Lorenz  curve  is 


L{u\x)  =  (  /  l{f  <  u}F-\t\x)dt'^/(^  I  F-\t 


x)dt 
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which  is  a  ratio  of  a  partial  mean  to  the  mean.  Hadamard  differentiability  of  these 
statistics  with  respect  to  the  underlying  Q{u\x)  immediately  follows  from  the  Hadamard 
differentiability  of  the  elementary  functionals  of  Proposition  7  by  means  of  the  chain 
rule.  Therefore,  the  asymptotic  distribution  of  these  statistics  can  be  determined  from 
the  asymptotic  distribution  of  the  linear  functionals,  by  the  functional  delta  method. 
In  particular,  the  validity  of  the  bootstrap  for  these  functionals  is  preserved  by  the 
functional  delta  method  for  the  bootstrap. 

We  next  consider  the  empirical  properties  of  the  smoothed  curves  obtained  by  applying 
the  hnear  smoothing  operator  S  defined  in  (2.6)  to  F{y'\x)  and  F~^{u\x): 

SF{y\x)  :=  Jh{y  -  y')F{y'\x)dy',       SF-\u\x)  :=  J  ks{u  -  u')F-\u'\x)du'. 

The  following  corollary  immediately  follows  from  Corollary  2  and  the  functional  delta 
method. 

Corollary  4  (Large  Sample  Properties  of  Smoothed  Curves).  Under  the  conditions  of 
Proposition  6,  in  ^°°{yM), 

MSF{y\x)  -  SF{y\x))  ^  S[Dc{y\x)], 

as  a  stochastic  process  indexed  by  {y,x)  E  yX,  and  in  ^°°{UX), 

^/7^{SF-\u\x)  -  SF-\u\x))  ^  S[bG{u\x)l 

as  a  stochastic  process  indexed  by  {u,x)  E  UX. 

Thus,  inference  on  the  smoothed  rearranged  estimates  can  be  performed  without 
excluding  nonregular  neighborhoods,  which  is  convenient  for  practice.  Furthermore, 
validity  of  the  bootstrap  for  the  smoothed  curves  follows  by  the  functional  delta  method 
for  the  bootstrap. 

3.  Theory  of  Rearranged  Distribution  Curves 

The  rearrangement  method  can  also  be  applied  to  rearrange  cumulative  distribution 
curves  monotonically  by  exchanging  the  roles  of  the  quantile  and  probability  spaces. 

There  are  several  situations  where  one  might  be  faced  with  the  problem  of  non- 
increasing  empirical  distribution  curves.     In  an  option  pricing  context,  for  example, 


18 

Ait-Sahalia  and  Duarte  (2003)  use  market  data  to  estimate  a  risk-neutral  distribution. 
Estimation  error  may  cause  the  resulting  distribution  function  to  be  non-monotonic.  In 
other  cases  the  distribution  curve  is  obtained  by  some  inverse  transformation  and  local 
non-monotonicity  comes  as  an  artefact  of  the  regularization  technique.  In  other  situa- 
tions the  particular  estimation  technique  may  not  respect  monotonicity  (see,  e.g.,  Hall, 
Wolff,  and  Yao,  1999).  We  present  an  alternative  solution  to  this  problem  that  uses  the 
rearrangement  method. 

Here  we  do  not  present  the  conditional  case  for  notational  convenience.  All  derivations 
for  conditional  distributions,  however,  are  exactly  parallel  to  those  presented  in  this 
section.  Suppose  we  have  y  h^  P{y)  as  a  candidate  empirical  probability  distribution 
curve,  which  does  not  necessarily  satisfy  monotonicity,  with  population  counterpart 
P{y).  Define  the  following  quantile  curve 

/•oo  rQ 

Q[u)  =  l{P{y)<  u}dy  -  l{P{y)>  u}dy, 

JQ  J -oo 

which  is  monotone.  In  what  follows,  we  further  assume  that  the  support  of  P{y)  is 
y  C  [0,  +oo),  so  that  the  second  term  drops  out  (otherwise  it  can  be  treated  analogously 
to  the  first). 

The  inverse  of  the  quantile  curve  is  the  rearranged  probabihty  curve 


F{y)  =  M{y:Q{u)>y}, 


which  is  also  monotone  by  construction.  It  should  be  clear  at  this  point  that  the  quan- 
tities Q  and  F  are  exactly  symmetric  to  F  and  Q  in  the  quantile  case. 

The  following  improved  approximation  property  is  true  for  F:  Let  Fo{y)  be  the  true 
distribution  function,  then  for  all  p  G  [l,oo], 

(J  |Fo(y)  -  F{y)\Pdy^    '  <  (j^  \Fo{y)  -  P{y)\''dy^      , 

where  the  inequality  is  strict  for  p  E  (l,cx))  whenever  the  integral  on  the  right  is  finite 
and  y  h^  P{y)  is  decreasing  on  a  subset  of  positive  Lebesgue  measure,,  while  Fo(u)  is 
strictly  increasing.  This  property  is  independent  of  the  sample  size,  and  thus  continues 
to  hold  in  the  population. 
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In  the  monotonically  correct  case,  that  is  when  P'  {Q{u))  >  0  for  all  u  G  [0, 1],  if  the 
empirical  distribution  curve  P{y)  satisfies 

V^[Piy)~P{y))^G{y) 
in  i°°{y),  where  G  is  a  Gaussian  process,  then 

in£°°([0,l]),and,  in  e^{y), 

V^(F(y)-F(y))^G(y).  (3.1) 

Results  paralleling  those  of  the  previous  section  also  follow  for  the  monotonically 
incorrect  case.  In  particular,  we  have 

^^"^G(y,(.)) 


^/^  {q{u)  -  Q{u))  =^  E  ^ 

fc=i 

in  £°°([0, 1]),  and 


{ykH)\ 


' K(u)  \    '■*  K(u) 


^(^<"'-'^')-(1f(^)  twmk^ 


(3.2) 

u=P[y) 

in  ^°°{y),  where  {yk{u),  for  A;  =  1, ...,  K{u)]  are  the  roots  of  P{y)  =  u,  assuming  K{u) 
is  bounded  uniformly  in  u. 

4.  Illustrative  Examples 

4.1.  Empirical  Example.  To  illustrate  the  practical  applicability  of  the  rearrangement 
method,  we  consider  the  estimation  of  expenditure  curves.  We  use  the  original  Engel 
(1857)  data,  from  235  budget  surveys  of  19th  century  working-class  Belgium  households, 
to  estimate  the  relationship  between  food  expenditure  and  annual  household  income  (see 
Koenker,  2005).  Ernst  Engel  originally  presented  these  data  to  support  the  hypothesis 
that  food  expenditure  constitutes  a  declining  share  of  household  income  (Engel's  Law). 

In  Figure  3,  we  show  a  scatterplot  of  the  Engel  data  on  food  expenditure  versus  house- 
hold income,  along  with  quantile  regression  curves  with  the  qua.ntile  indices  {05,0.1, ..., 
0.95}.  We  see  that  the  quantile  regression  lines  become  closer  and  cross  at  low  values  of 
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income.  This  crossing  problem  of  the  Engel  curves  is  also  evident  in  Figure  4,  in  which 
we  plot  the  quantile  regression  process  of  food  expenditure  as  a  function  of  the  quantile 
index.  For  low  values  of  income,  the  quantile  regression  process  is  clearly  non-monotone. 
The  rearrangement  procedure  fixes  the  non-monotonicity  producing  increasing  quantile 
functions.  Moreover,  the  rearranged  curves  coincide  with  their  quantile  regression  coun- 
terparts for  the  middle  values  of  income  where  there  is  no  quantile-crossing  problem. 

In  Figure  5,  we  plot  simultaneous  90%  confidence  intervals  for  the  conditional  quantile 
function  of  food  expenditure  for  different  values  of  income  (at  the  sample  median,  and  the 
5%  percentile  of  income).  We  construct  the  bands  using  both  original  quantile  regression 
curves  and  rearranged  quantile  curves  based  on  500  bootstrap  repetitions  and  a  grid  of 
quantile  indices  {0.10,  0.11, ....  0.90}.  We  obtain  the  bands  for  the  rearranged  curves 
assuming  that  the  population  quantile  regression  curves  are  monotonically  correct,  so 
that  the  first  order  behavior  of  the  rearranged  curves  coincides  with  the  behavior  of  the 
original  curves.  The  figure  shows  that  even  for  the  low  value  of  income  the  rearranged 
bands  lie  within  the  quantile  regression  bands.  This  observation  points  towards  the 
maintained  assumption  of  the  monotonically  correct  case.  The  lack  of  monotonicity  of 
the  estimated  quantile  regression  process  in  this  case  is  likely  to  by  caused  by  sampling 
error. 

We  find  more  evidence  consistent  with  the  monotonically  correct  case  in  Figure  6,  in 
which  we  plot  the  simultaneous  confidence  bands  for  the  smoothed  quantile  regression 
and  rearranged  curves.  We  construct  the  band  by  bootstrapping  the  smoothed  curves 
(with  bandwidth  equal  to  .05).  The  bootstrap  bands  are  valid  for  the  smoothed  rear- 
ranged curves  even  in  the  monotonically  incorrect  case.  The  almost  perfect  overlapping 
between  the  confidence  bands  points  towards  the  monotonically  correct  case.  Interest- 
ingly, smoothing  reduces  the  width  of  the  confidence  bands,  but  does  not  completely 
monotonize  the  quantile  regression  curves. 

4.2.  Monte  Carlo.  We  use  the  following  Monte  Carlo  experiment,  matching  closely  the 
previous  empirical  apphcation,  to  illustrate  the  estimation  properties  of  the  rearranged 
curves  in  finite  samples.  In  particular,  we  consider  two  designs  based  on  the  location- 
scale  shift  model:  Y  =  Z{X)'a  +  {Z{X)'')')e,  where  e  is  independent  of  X,  with  the  true 
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Figure  3.  The  scatterplot  and  quantile  regression  fits  of  the  Engel 
food  expenditure  data.  The  plot  shows  a  scatterplot  of  the  Engel  data 
on  food  expenditure  vs.  household  income  for  a  sample  of  235  19th  cen- 
tury working-class  Belgium  households.  Superimposed  on  the  plot  are  the 
{0.05, 0.10,  ...,0.95}  quantile  regression  curves.  The  range  displayed  corre- 
sponds to  values  of  income  lower  than  1500  and  values  of  food  expenditure 
lower  than  800. 


conditional  quantile  function 

Qo{u\X)  =  Z{xya  +  {Z{Xy^)Q,{u). 

Design  1  includes  a  constant  and  a  regressor,  namely  Z{X)  =  {1,X);  and  design  2 
has  an  additional  nonlinear  regressor,  namely,  Z{X)  =  (1,.Y,  1{X  >  a}  ■  X),  where 
a  =  median(X).  We  select  the  parameters  for  designs  1  and  2  to  match  the  Engel 
empirical  example,  employing  the  estimation  method  of  Koenker  and  Xiao  (2002).  For 
design  1  we  set  a  =  (624,15,0.55)  and  7  =  (1,0.0013);  and  for  design  2  we  set  a  = 
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A.  Income  =  394  (1%  quantile) 


B.  Income  =  452  (5%  quantile) 


0.0         0.2         0.4         0.6 


C.  Income  =  884  (Median) 


D.  Income  =  2533  (99%  quantile) 


Figure  4.  Quantile  regression  processes  and  rearranged  quantile  pro- 
cesses for  the  Engel  food  expenditure  data.  Quantile  regression  estimates 
are  plotted  with  a  thick  gray  line,  whereas  the  rearranged  estimates  are 
plotted  in  black. 


(624.15,0.55,-0.003)  and  7  =  (1,0.0017,-0.0003).  For  each  design,  we  draw  1,000 
Monte  Carlo  samples  of  size  n  =  235.  To  generate  the  values  of  the  dependent  variable, 
we  draw  observations  from  a  normal  distribution  with  the  same  mean  and  variance  as 
the  residuals  e  =  (V  -  Z(A')'a')/(Z(X)'7)  of  the  Engel  data  set;  and  we  fix  the  regressor 
X  in  all  the  replications  to  the  observations  of  income  in  the  Engel  data  set. 

We  use  designs  1  and  2  to  assess  the  estimation  properties  of  the  original  and  rear- 
ranged quantile  regressions  under  the  correct  and  incorrect  specification  of  the  functional 
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A.  Income  =  452  (5%  quantile) 


B.  Income  =  884  (Median) 


Figure  5.  Simultaneous  90%  confidence  bands  for  quantile  regression 
processes  and  rearranged  quantile  processes  for  the  Engel  food  expenditure 
data.  Two  different  values  of  the  income  regressor  are  considered.  The 
bands  for  quantile  regression  are  plotted  in  light  gray,  whereas  the  bands 
for  rearranged  quantile  regression  are  plotted  in  dark  gray. 


form.  Thus,  in  each  replication,  we  estimate  the  model 

Q{u\X)  =  Z{Xyp{u),    Z{X)  =  {l,X). 

This  gives  the  correct  functional  form  for  design  1,  that  is,  Q{u\X)  =  Qo{u\X),  and  an 
incorrect  functional  form  for  design  2,  that  is  Q{u\X)  ^  Qo{u\X)  (due  to  the  omission  of 
a  nonlinear  regressor).  Accordingly,  estimation  error  for  design  1  arises  entirely  due  to 
sampling  error,  while  the  estimation  error  for  design  2  arises  due  to  both  sampling  error 
and  specification  error.  Regardless  of  the  nature  of  the  estimation  error,  Proposition  5 
establishes  that  the  rearranged  quantile  curves  should  be  closer  to  the  true  conditional 
quantiles  than  the  original  curves. 
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A.  Income  =  452  (5%  quantile) 


B.  Income  =  884  (Median) 


Figure  6.  Simultaneous  90%  confidence  bands  for  the  smoothed  quan- 
tile regression  processes  and  the  smoothed  rearranged  quantile  processes 
for  the  Engel  food  expenditure  data.  The  bands  for  the  smoothed  original 
curves  are  plotted  in  light  gray,  whereas  the  bands  for  the  smoothed  rear- 
ranged curves  are  plotted  in  dark  gray.  The  smoothed  curves  are  obtained 
using  a  bandwidth  equal  to  0.05. 


In  each  replication,  we  fit  a  linear  quantile  regression  curve  Q{u\X)  =  X  (3{u)  and 

monotonize  this  curve  to  get  F~^{u\X)  using  the  rearrangement  method.  Table  1  reports 

measures  of  the  estimation  error  of  the  original  and  rearranged  estimated  conditional 

quantile  curves  using  different  norms  {p  =  1,2,  3,4,  and  oo),  with  the  regressor  fixed  at 

a  value,  X  =  xq,  that  corresponds  to  the  5%  quantile  of  the  regressor  X  {X  =  452). 

We  select  this  value  motivated  by  the  empirical  example.  Each  entry  of  the  table  gives 

a  Monte  Carlo  average  of 

i/p 
U':={    /   \QQ{u\xo)-Q{u\xoWdu 
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for  Q{u\xo)  =  XqP{u)  and  Q{u\xo)  =  F~^{u\xq).  We  evaluate  the  integral  using  a  net  of 
indices  u  of  size  .01. 

Both  in  the  correctly  specified  case  and  in  the  misspecified  case,  we  find  that  the 
rearranged  curves  estimate  the  true  quantile  curves  Qo{u\X)  more  accurately  than  the 
original  curves,  providing  a  4%  to  15%  reduction  in  the  estimation/approximation  error, 
depending  on  the  norm. 

Table  1.  Estimation  Error  of  Original  and  Rearranged  Curves. 


Design  1 

Correct  Specification 

Design  2 

Incorrect  Specification 

Original 

Rearranged 

Ratio 

Original 

Rearranged 

Ratio 

L' 

6.79 

6.61 

0.96 

7.33 

7.02 

0.95 

L2 

7.99 

7.69 

0.95 

8.72 

8.20 

0.93 

L3 

8.93 

8,51 

0.95 

9.85 

9.12 

0.92 

L' 

9.70 

9.17 

0.94 

10.78 

9.86 

0.91 

L°° 

17.14 

15.32 

0.90 

19.44 

16.44 

0.85 

5.  Conclusion 

This  paper  analyzes  a  simple  regularization  procedure  for  estimation  of  conditional 
quantile  and  distribution  functions  based  on  rearrangement  operators.  Starting  from 
a  possibly  non-monotone  empirical  curve,  the  procedure  produces  a  rearranged  curve 
that  not  only  satisfies  the  natural  monotonicity  requirement,  but  also  has  smaller  esti- 
mation error  than  the  original  curve.  Asymptotic  distribution  theory  is  derived  for  the 
rearranged  curves,  and  the  usefulness  of  the  approach  is  illustrated  with  an  empirical 
example  and  a  simulation  experiment. 
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Appendix  A.  Proofs 

A.l.  Proof  of  Proposition  1.  First,  note  that  the  distribution  of  Y^  has  no  atoms, 
i.e., 

Pr[y^  =  y]  =  Fv[Q{U\x)  =  y]  =  Pr[[/  e  {ueU  -.u  is  &  root  of  Q{u\x)  =  y}]  =  0, 

since  the  number  of  roots  of  Q{u\x)  =  y  is  finite  under  (a)  -  (b),  and  U  ~  Uniform(W). 
Next,  by  assumptions  (a)-(b)  the  number  of  critical  values  of  Q{u\x)  is  finite,  hence 
claim  (1)  follows. 

Next,  for  any  regular  y,  we  can  write  F{y\x)  as 
/    l{Q{u\x)  <y]du=     J^      /  \{Q{u\x)  <y]du+  I  l{Q{u\x)  <  y]du, 

where  Uo{y\x)  :=  0  and  {uk{y\x),  for  /c  =  1,  ...,K{y\x)  <  oo}  are  the  roots  oiQ{u\x)  =  y 
in  increasing  order.  Note  that  the  sign  of  Q'{u\x)  alternates  over  consecutive  Uk{y\x), 
determining  whether  l{Q(y|a:)  <  y}  =  1  on  the  interval  [uk^i{y\x),Uk{y\x)].  Hence 
the  first  term  in  the  previous  expression  simplifies  to  Y^i^Iq  l{Q'{ui:+i{y\x)\x)  > 
0}{uk+i{y\x)  —  Uk{y\x));  while  the  last  term  simplifies  to  1{Q' {uK{y\x){y\x)\x)  <  0}(1  - 
'^K{y\x){y\->-))-  An  additional  simplification  yields  the  expression  given  in  claim  (2)  of  the 
proposition. 

The  proof  of  claim  (3)  follows  by  taking  the  derivative  of  expression  in  claim  (2),  noting 
that  at  any  regular  value  y  the  number  of  solutions  K{y\x)  and  sign{Q' {uk{y\x)\x))  are 
locally  constant;  moreover, 

>(  I   x_  sign{Q'{uk{y\x)\x)) 
'''^^^^^~      \Q'iu,{y\x)\x)\     ■ 

Combining  these  facts  we  get  the  expression  for  the  derivative  given  in  claim  (3), 

To  show  the  absolute  continuity  of  F{y\x)  with  /(y|x)  being  the  Radon-Nykodym 
derivative,  it  suffices  to  show  that  for  each  y'  G  3^x,  Jl^f{y\^)dy  =  J^^dF{y\x),  cf 
Theorem  31.8  in  Billingsley  (1995).  Let  V^^  be  the  union  of  closed  balls  of  radius  t 
centered  on  the  critical  points  y^^  \y*,  and  define  y^  =  yx\V{^.  Then,  J^  l{y  G 
yl}f{y\x)dy  =  /^^  l{y  e  yl}dF{y\x).  Since  the  set  of  critical  points  3^^,  \  y*  is  finite 
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and  has  mass  zero  under  F{y\x),  J^^l{y  G  yl}dF{y\x)  t  J^^dF{y\x)  as  t  ^  0. 
Therefore,  /_^;^  l{y  €  yi}f{y\x)dy  ]  r_'^  fiy\x)dy  =  J^;^  dF{y\x). 

Claim  (4)  follows  by  noting  that  at  the  regions  where  s  — >  Q{s\x)  is  increasing  and 
one-to-one,  we  have  that  F{y\x)  =  !Q^s\x)<y^^  =  ^s<Q-Hy\x)^^  =  Q~\y\^)-  Inverting 
the  equation  u  =  F [F~'^ [u\x)\x)  =  Q~'^ {F~^ [u\x)\x)  yields  F~^{u\x)  =  Q{u\x). 

Claim  (5).  We  have  Y^  =  Q{U\x)  has  quantile  function  F~^{u\x).  The  quantile 
function  of  a  -t-  (3Q{U\x)  =  a  +  (3Y^,  for  /3  >  0,  is  therefore  inf{y  :  Pr(a  +  PV^  <  y)  > 
u}  =  a  +  (3F~^{u\x). 

Claim  (6)  is  immediate  from  claim  (3). 

Claim  (7).  The  proof  of  continuity  of  F{y\x)  is  subsumed  in  the  step  1  of  the  proof  of 
Proposition  3  (see  below).  Therefore,  for  any  sequence  Xt  —^  x  we  have  that  F{y\xt)  — > 
F{y\x)  uniformly  in  y,  and  F{y\x)  is  continuous.  Let  Ut  —^  u  and  Xt  -^  x.  Since 
F{y\x)  =  u  has  a  unique  root  y  =  F~^{u\x),  the  root  of  F{y\xt)  =  Ut,  i.e.,  yt  = 
■F'^{ut\xt),  converges  to  y  by  a  standard  argument,  see,  e.g.,  van  der  Vaart  and  Wellner 
(1997).  D 

A. 2.  Proof  of  Propositions  2-7.  In  the  proofs  that  follow  we  will  repeatedly  use 
Lemma  1,  which  establishes  the  equivalence  of  continuous  convergence  and  uniform 
convergence: 

Lemma  1.  Let  D  and  D'  be  complete  separable  metric  spaces,  with  D  compact.  Suppose 
f  :  D  ^>  D'  is  continuous.  Then  a  sequence  of  functions  fn'-D^D'  converges  to  f 
uniformly  on  D  if  and  only  if  for  any  convergent  sequence  x^  ^>  x  in  D  we  have  that 
fni^n)  -^  f{x). 

Proof  of  Lemma  1:  See,  for  example,  Resnick  (1987),  page  2.  D 

Proof  of  Proposition  2.  We  have  that  for  any  ^  >  0,  there  exists  e  >  0  such  that 
for  u  e  B^{uk{y\x))  and  for  small  enough  t  >  0 

l{Qiu\x)  +  tht{u\x)  <y}<  l{Q{u\x)  +  t{h{uk{y\x)\x)  -  5)  <  y}, 

for  all  k  e  I, ...,  K{y\x);  whereas  for  ah  u  0  [JkB^{ui,{y\x)),  as  i  ^  0, 

l{Q{u\x)  +  tht{u\x)  <y}  =  l{Q{u\x)  <  y}. 
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Therefore, 

f^  l{Q{u\x)  +  tht{u\x)  <  y]du  -  J^  l{Q{u\x)  <  y}du 

^    ""y     f  l{Q{u\x)  +  t{h{u,{y\x)\x)  -  5)  <  y}  -  l{Q{u\x)  <  y} ^^ 

~-      k=l    JB,{uk(y\x))  i 

which  by  the  change  of  variables  y'  —  Q{u\x)  is  equal  to 

K{y\x) 


\Y. 


^  dy\ 


i     fc=l     J Jkn\y,v-t{Huk(y\x)\x)^6)]   \Q' [Q    ^{y'\x)\x 

where  Jk  is  the  image  of  B^{uk{y\x))  under  u  i-^  Q{-\x).    The  change  of  variables  is 
possible  because  for  e  small  enough,  Q{-\x)  is  one-to-one  between  B^{uf.{y\x))  and  J^. 

Fixing  e  >  0,  for  i  — >  0,  we  have  that  Jk  fl  [y,y  —  t{h{uk{y\x)\x)  —  5)]  =  [y,y  — 
t{h{uk{y\x)\x)  -  5)1  and  \Q'{Q-\y'\x)\x)\  -^  \Q' {uk{y\x)\x)\  as  Q-\y'\x)  ^  Uk{y\x). 
Therefore,  the  right  hand  term  in  (A.l)  is  ^lO  greater  than 

Y^  -h{uk{y\x)\x)  +  5 

^      \Q'{uk{y\x)\x)\     ^    ^  ^■ 

Similarly  ES'^^  lQ'"u[l£')t^^  +  ^(1)  bounds  (A.l)  from  below.    Since  5  >  0  can  be 
made  arbitrarily  small,  the  result  follows. 

To  show  that  the  result  holds  uniformly  in  {y,x)  E  K,  a  compact  subset  of  yP^*,  we 
use  Lemma  1.  Take  a  sequence  of  {yt,Xt)  in  K  that  converges  to  (y,x)  G  K,  then  the 
preceding  argument  applies  to  this  sequence,  since  (1)  the  function  {y,x)  i-^  \o'(u''(  \x)\x)\ 
is  uniformly  continuous  on  K,  and  (2)  the  function  (y,x)  i-^-  K{y\x)  is  uniformly  con- 
tinuous on  K.  To  see  (2),  note  that  K  excludes  a  neighborhood  of  critical  points 
{y\yx,x  €  ^),  and  therefore  can  be  expressed  as  the  union  of  a  finite  number  of  com- 
pact sets  [Ki, ...,  Km)  such  that  the  function  K{y\x)  is  constant  over  each  of  these  sets, 
i.e.,  K{y\x)  =  kj  for  some  integer  kj  >  0,  for  all  (y,  x)  E  Kj  and  j  G  {1, ...,  M}.  Likewise, 
(1)  follows  by  noting  that  the  limit  expression  for  the  derivative  is  continuous  on  each  of 
the  sets  {Ki, ...,  Km)  by  the  assumed  continuity  of  h{u\x)  in  both  arguments,  continuity 
of  Uk{y\x)  (implied  by  the  Implicit  Function  Theorem),  and  the  assumed  continuity  of 
Q'{u\x)  in  both  arguments.  D 
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Proof  of  Proposition  3.  For  a  fixed  x  the  result  follows  by  Proposition  2,  by  step 
1  of  the  proof  below,  and  by  an  application  of  the  Hadamard  differentiability  of  the 
quantile  operator  shown  by  Doss  and  Gill  (1992).  Step  2  establishes  uniformity  over 
xeX. 

Step  1.  Let  /C  be  a  compact  subset  of  3^^*.  Let  (y^,  Xt)  be  a  sequence  in  K,  convergent 
to  a  point,  say  (y,  x).  Then,  for  every  such  sequence,  Ct  :=  ^||/!-t||oo  +  ||<?(-kt)  — <5('k)||oo  + 
\yt  -  y|  -^  0,  and 

\F{yt\xu  ht)  -  F{y\x)\     <        f  [l{Q(n|x,)  +  tht{u\x)  <  yj  -  l{Q{u\x)  <  y}]du 

Jo 

<       f   l{\Q{u\x)-y\<et}du  -^0,  (A.2) 

Jo 

where  the  last  step  follows  from  the  absolute  continuity  of  y  i— >  F{y\x),  the  distribution 

function  of  Q{U\x).    By  setting  ht  =  0  the  above  argument  also  verifies  that  F{y\x) 

is  continuous  in  {y,x).   Lemma  1  implies  uniform  convergence  of  F{y\x,ht)  to  F{y\x), 

which  in  turn  implies  by  a  standard  argument^  the  uniform  convergence  of  quantiles 

F~^{u\x,ht)  -^  F~^{u\x),  uniformly  over  K* ,  where  K*  is  any  compact  subset  oiUX*. 

Step  2.  We  have  that  uniformly  over  A'*, 

F[F-\u\xM\:cM)-F[F-M-M)\x)  ^  ^^(^-:(,|,^  ,^)|,)  ^  ^^,^^ 

t  (A.3) 

=  Dh{F-\u\x)\x)  +  o{\), 

using  Step  1,  Proposition  2,  and  the  continuity  properties  of  Dh{y\x).  Further,  uniformly 
over  K* ,  by  Taylor  expansion  and  Proposition  1,  as  i  — >•  0, 

F{F-\u\xM)\x)  -  F{F-\u\x)\x)  ^  ^(j.-i(-^|^)|^)-^'^H^>  M  -  i^-HH^)  ^  ^^^^^ 

(A.4) 
and  (as  -will  be  shown  below) 

F{F-\u\x,ht)\x,ht)  -  F{F-\u\x)\x)  _^^^^  ■       ^^_^^ 

as  i  -*  0.    Observe  that  the  left  hand  side  of  (A. 5)  equals  that  of  (A.4)  plus  that  of 
(A.3).  The  result  then  follows. 


^See,  e.g.,  Lemma  1  in  Chernozhukov  and  Fernandez- Val  (2005). 
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It  only  remains  to  show  that  equation  (A. 5)  holds  uniformly  in  K*.  Note  that  for  any 
right-continuous  cdf  F,  we  have  that  u  <  F{F~^{u))  <  u  +  F{F~^{u))  —  F(F~^(n)  — ), 
where  F{  — )  denotes  the  left  limit  of  F,  i.e.,  F{xo—)  =  lim3;f3;o  F{x).  For  any  continuous, 
strictly  increasing  cdf  F,  we  have  that  F{F~^{u))  =  u.  Therefore,  write 
F{F-\u\x,ht)\x,ht)  -  F{F-\u\x)\x) 


0< 
< 
< 


t 

u  +  F{F~^{u\x,  ht)\x,  ht)  —  F{F~^{u\x,  ht)  -  |x,  ht)  —  u 

t 
F{F-\u\x,  ht)\x,  ht)  -  F{F-\u\x,  ht)  -  \x,  ht) 


t 

(^)  [F{F-\u\x,ht)\x,ht)  -  F{F-\u\x,ht)\x)] 

t 
[F{F-\u\x,  ht)  -  \x,  ht)  -  F{F-\u\x,  ht)  -  \x)] 

t 

'=^  DH{F-\u\x,ht)\x)  -  DhiF-\u\x,  ht)  -  \x)  +  o(l)  =  o(l), 

as  i  — V  0,  where  in  (1)  we  use  that  F{F~^{u\x,  ht)\x)  =  F{F~^{u\x,  ht)  —  \x)  since  F{y\x) 
is  continuous  and  strictly  increasing  in  y,  and  in  (2)  we  use  Proposition  2.  D 

The  following  lemma,  due  to  Pratt  (1960),  will  be  very  useful  to  prove  Proposition  4. 

Lemma  2.  Let  \fn\  <  Gn  and  suppose  that  fn  ^  f  o,nd  Gn  —>  G  almost  everywhere, 
then  if  J  Gn  —^  J  G  finite,  then  J  fn  —*  J  f  ■ 

Proof  of  Lemma  2.  See  Pratt  (1960).  D 

Lemma  3  (Boundedness  and  Integrabihty  Properties).   Under  the  hypotheses  of  Propo- 
sition 2  and  3,  we  have  that  for  all  {y,x)  E  yX : 

\DhMx,t)\<\\ht\U,  (A.6) 

and 

Jo  f 

where  for  any  Xt^>-xEPi,ast—^Q, 

A{y\xt,t)  -^  2\\h\\^f{y\x)  for  a.ey  ey  and  /  A{y\xt,t)dy  -^  /  2\\h\\o^f{y\x)dy. 

Jy  Jy 
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Proof  of  Lemma  3.  To  show  (A. 6)  note  that 

sup     |5ft,(y|x,t)|  <  ||/ii||oo  (A.8) 

immediately  follows  from  the  equivariance  property  noted  in  Claim  (5)  of  Proposition  1. 

The  inequality  (A. 7)  is  trivial.  That  for  any  Xf  — >  x  G  X,  lS.{y\xt,t)  -^  2||/i||oo/(y|a;) 
for  a.e  y  E  y  follows  by  applying  Proposition  2  respectively  with  functions  h[{u\x)  = 
||/).t||oo  and  h'f{u,x)  =  —\\ht\\oo  (for  the  case  when  f{y\x)  >  0;  and  trivially  otherwise). 
Similarly,  that  for  any  yt  ^>  y  ^y,  /S.{yt\x,  t)  -^  2||/i||oo/(y|a;)  for  a.e  x  ^  A!  follows  by 
Proposition  2  (for  the  case  when  f{y\x)  >  0;  and  trivially  otherwise)  . 

Further,  by  Fubini's  Theorem, 


=  :  /t(") 

Note  that  ft{u)  .<_.2||/it||oo.  Moreover,  for  almost  every  u,  ft{u)  =  2||/!.t||oo  for  small 
enough  i,  and  2||/i4||(x)  converges  to  2||/i||oo  as  t  -^  0.  Then,  trivially,  2  J^  ||/ii||ooo!w  — > 
2||^||oo.  By  Lemma  2  the  right  hand  side  of  (A. 9)  converges  to  2||/i||tx)-  □ 

A. 3.  Proof  of  Proposition  4.  Define  mt{y\x,y')  :=  l{y  <  y'}g{y\x)Dht{y\x,t)  and 
m{y\x,y')  :=  l{y  <  y'}g{y\x)Dh{y\x).  To  show  claim  (1),  we  need  to  demostrate  that 
for  any  y[  — >  y'  and  x^  — >  x 

/  mt{y\xt,y't)dy  ^   /  m{y\x,y')dy,  (A.IO) 

Jy  Jy 

and  that  the  limit  is  continuous  in  {x,y').  We  have  that  \mt{y\xt,yt)\  is  bounded,  for 
some  constant  C,  by  CA(y|xt,  t)  which  converges  a.e.  and  the  integral  of  which  converges 
to  a  finite  number  by  Lemma  3.  Moreover,  by  Proposition  2,  for  almost  every  y  we  have 
Tnt{y\xt.,y[)  -^  m{y\x,y').  We  conclude  that  (A.IO)  holds  by  Lemma  2. 

In  order  to  check  continuity,  we  need  to  show  that  for  any  y[  — »  y'  and  Xt  — >  x 

m{y\xt,yt)dy -^       m{y\x,y')dy.  (A.ll) 

Jy  Jy 

We  have  that  m{y\xt,y[)  ^  m(y|x,y')  for  almost  every  y.     Moreover,  m{y\xt,yt)  is 

dominated  by  \\g\\oo\\h\\oof{y\xt),  which  converges  to  ||5||oo||^||oo/(y|3;)  for  almost  every 
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y,  and,  moreover,  /^  \\g\\o^\\h\\^f{y\x)dy  converges  to  ||g||oo||/i||cx:>-  Conclude  that  (A.ll) 
holds  by  Lemma  2. 

To  show  claim  (2),  define  m^t{'<^\x,u')  =  l{u  <  u'}g{u\x)Dht{u\x)  and  m{u\x,u')  = 
l{u  <  u'}g{u\x)Dh{u\x).  Here  we  need  to  show  that  for  any  u[  -^  u'  and  Xt  —^  x 

/  mt{u\xt,u[)du  —^    /  m{u\x,u')du,  (A. 12) 

Ju  Ju 

and  that  the  limit  is  continuous  in  {u',x).  We  have  that  mt{u\xt,u[)  is  bounded  by 
^(ulxj)!!/?,^!!^),  which  converges  to  g'('u|a:)||/i||oo  for  a.e.  u.  Furthermore,  the  integral  of 
5(u|a:t)||/it||oo  converges  to  the  integral  of  5^(u|a;)||/i||oo  by  the  dominated  convergence 
theorem.  Moreover,  by  Proposition  2,  we  have  that  mt{u\xt.,u[)  — >  m{u\x,u')  for  almost 
every  u.  We  conclude  that  (A.  12)  holds  by  Lemma  2. 

In  order  to  check  the  continuity  of  the  limit,  we  need  to  show  that  for  any  u[  — >  u' 
and  Xf  — >  X 

/  m{u\xt,u[)du  ^^   /  m{u\x,u')du.    -    ^,  (A. 13) 

Ju  Ju 

We  have  that  m{u\xt,u[)  — >  m{u\x,u')  for  almost  every  u.  Moreover,  for  smaU  enough 
t,  m{u\xt,u[)  is  dominated  by  |g(u|3;j)|||/i||oo,  which  converges  for  almost  every  value  of 
u  to  |5(u|a;)|||/i||oo  as  t  -^  0.  Furthermore,  the  integral  of  |5(ti|xt)|||/!.||oo  converges  to  the 
integral  of  |g'(u|a;)|||/i||oo  by  dominated  convergence  theorem.  We  conclude  that  (A. 13) 
holds  by  Lemma  2.  D 

The  following  lemma  will  be  used  to  prove  Proposition  5: 

Lemma  4.  Assume  that  Q{u)  is  a  function  mapping  U  :=  (0, 1)  to  K,  a  bounded  subset 

of  R,  and  that  Qo{u)  is  a  non- decreasing  function  mupping  U  to  K .     Think  of  Q{u) 

as  an  approximation  to  Qq{u).   Let  FQ[y)  =  J^l{Q{u)  <  y}du  denote  the  distribution 

function  of  Q{U)  when  U  ~  (7(0,1).    Let  Q*{u)  =  Fq\u)  =  inf{y  G  M  :  Fqiy)  >  u}. 

Then,  for  any  p  G  [1,  oo], 

i/p       r  r  1  i/p 

< 


\Qoiu)~Q*{u)fdu 
uu 


\Q,{u)  -  Q{u)f  du 
uu 


Moreover,  this  inequality  is  strict  provided  (1)  p  G  (l,oo),  (2)  Q{u)  is  decreasing  on 
a  subset  of  U  that  has  positive  Lebesgue  measure,  and  (3)  the  true  function  Qo{u)  is 
increasing  on  U. 
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Proof  of  Lemma  4.  A  direct  proof  of  this  lemma  is  given  in  Proposition  1  of 
Chernozhukov,  Fernandez- Val,  Galichon  (2006a).  It  is  helpful  to  give  a  quick  indirect 
proof  of  the  weak  inequality  contained  in  the  lemma  using  the  following  inequality  due 
to  Lorentz  (1953):  Let  Q  and  G  be  two  functions  mapping  U  to  K,  a  bounded  subset 
of  R.  Let  Q*  and  G*  denote  their  corresponding  increasing  rearrangements.  Then,  we 
have 


/  L{Q*{u),G*{u))du<  /  L{Q{u),G{u))du, 

for  any  submodular  discrepancy  function  L  :  R^  i— >•  E+  In  our  case,  G{u)  =  Qq{u)  = 
G*{u)  =  Qoiu)  almost  everywhere.  Thus,  the  true  function  is  its  own  rearrangement. 
Moreover,  L{v,w)  =  \w  —  v\^  is  submodular  for  p  e  [l,oo).  For  the  proof  of  the  strict 
inequality,  please  refer  to  Chernozhukov,  Fernandez- Val,  Galichon  (2006a),  Proposition 
1.  For  p  =  oo,  the  inequalities  follows  by  taking  limit  as  p  — >  oo.  D 

A. 4.  Proof  of  Proposition  5.  This  proposition  is  an  immediate  consequence  of  Lemma 
4.  D 

A. 5.  Proof  of  Proposition  6.  This  Proposition  simply  follows  by  the  functional  delta 
method  (e.g.  van  der  Vaart,  1998).  Instead  of  restating  what  this  method  is,  it  takes 
less  space  to  simply  recall  the  proof  in  the  current  context. 

To  show  the  first  part,  consider  the  map  gn{y,x\h)  —  y/n{F{y\x,n'^^'^h)  -  F{y\x)). 
The  sequence  of  maps  satisfies  gn'iy,x\hr,')  — >  Dh{y\x)  in  £'^{K)  for  every  subsequence 
hn'  -^  h  in  (.°°{UX*),  where  h  is  continuous.  It  follows  by  the  Extended  Continuous  Map- 
ping Theorem  that,  in  £°°{K),  gn{y,  x\y/n{Q{u\x)  -  Q{u\x)))  =»  Ddylx)  as  a  stochastic 
process  indexed  by  (y,x),  since  ^{Q{u\x)  -  Qiu\x))  =>  G{u\x)  in  £°°{K). 

Conclude  similarly  for  the  second  part.  D 

A. 6.  Proof  of  Proposition  7.  This  follows  by  the  functional  delta  method,  similarly 
to  the  proof  of  Proposition  6.  D 
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